ALIZE/spkdet: a state-of-the-art open source software for speaker recognition
نویسندگان
چکیده
This paper presents the ALIZE/SpkDet open source software packages for text independent speaker recognition. This software is based on the well-known UBM/GMM approach. It includes also the latest speaker recognition developments such as Latent Factor Analysis (LFA) and unsupervised adaptation. Discriminant classifiers such as SVM supervectors are also provided, linked with the Nuisance Attribute Projection (NAP). The software performance is demonstrated within the framework of the NIST’06 SRE evaluation campaign. Several other applications like speaker diarization, embedded speaker recognition, password dependent speaker recognition and pathological voice assessment are also presented.
منابع مشابه
ALIZE 3.0 - open source toolkit for state-of-the-art speaker recognition
ALIZE is an open-source platform for speaker recognition. The ALIZE library implements a low-level statistical engine based on the well-known Gaussian mixture modelling. The toolkit includes a set of high level tools dedicated to speaker recognition based on the latest developments in speaker recognition such as Joint Factor Analysis, Support Vector Machine, i-vector modelling and Probabilistic...
متن کاملApplication of automatic speaker recognition techniques to pathological voice assessment (dysphonia)
This paper investigates the adaptation of Automatic Speaker Recognition (ASR) techniques to the pathological voice assessment (dysphonic voices). The aim of this study is to provide a novel method, suitable for keeping track of the evolution of the patient’s pathology: easy-to-use, fast, non-invasive for the patient, and affordable for the clinicians. This method will be complementary to the ex...
متن کاملInter and Intra-speaker Variability in French: An Analysis of Oral Vowels and Its Implication for Automatic Speaker Verification
Intra and inter-speaker variability is studied as a way to better understand how voice can be used as biometric data. Formant values from 328,016 exemplars of the 10 French oral vowels uttered by 111 speakers were compared to estimate their speaker discrimination power. The vowels /œ/, /ɛ/ and /a/ appear to convey more idiosyncratic information than other oral vowels. A more comprehensive phone...
متن کاملThe RWTH aachen university open source speech recognition system
We announce the public availability of the RWTH Aachen University speech recognition toolkit. The toolkit includes state of the art speech recognition technology for acoustic model training and decoding. Speaker adaptation, speaker adaptive training, unsupervised training, a finite state automata library, and an efficient tree search decoder are notable components. Comprehensive documentation, ...
متن کاملAn Attack on a Text-independent Speaker Authentication System
We mount an effective attack on a third-party open-source text-independent speaker verification system. Specifically, we show how an attacker can simply use a signal generated at fixed frequency to pass speaker verification and gain access to other user accounts. We demonstrate this attack on the GMM-UBM based ALIZE speaker verification system using the YOHO voice database. We show through expe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008